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METHODS. COMPOSITIONS AND KITS 
FOR THE DETECTION AND MONITORING OF LUNG CANCER 

TECHNICAL FIELD OF THE INVENTION 

The present Invention relates generally to the field of cancer 
diagnostics. . More specifically, the present Invention relates to methods, 
compositions and kits for the detection of lung cancer in patients with different 
type, stage and grade of tumors that employ oligonucleotide hybridization and/or 
amplification to simultaneously detect two or more tissue-specific polynucleotides 
in a biological sample suspected of containing lung cancer cells. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

Lung cancer remains a significant health problem throughout the 
world. The failure of conventional lung cancer treatment regimens can commonly 
be attributed, in part, to delayed disease diagnosis. Although significant advances 
have been made in the area of lung cancer diagnosis, there still remains a need for 
improved detection methodologies that penriit early, reliable and sensitive 
determination of the presence! of lung cancer cells. 

EJescription of thie Related Art 

Lung cancer has the highest mortality rate of any of the cancers and 
is one of the most difficult to diagnose early. There are an estimated 1 million 
deaths annually woridwide for this disease. According to the American Cancer 
Society in 2002 alone there were an estimated 169,200 new cases diagnosed and 
~ 154.900 deaths. Typically lung cancers are classified into two major types: Non- 
Small Cell Lung Carcinomas (NSCLC) comprising squamous, adeno and large cell 
carcinomas and Small Cell Lung Carcinomas (SCLC). These groups represent 
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-75% and 25% of aH lung tumors respectively with adenocarcinoma and 
squamous cell carcinoma tieing the most prevalent forms of NSCLC with large cell 
carcinomas being -10%. Within the group of NSCLC. adenocarcinoma is currently 
the most prominent fomi of lung cancer in younger persons, women of all ages. 
5 lifetime nonsmokers and long-temi former smokers. SCLC typically fall into two 
subtypes oat cell and intermediate cell. Less common tumors Include carcinoid 
and mesotheliomas among others but these represent only a small percentage of 
all lung tumors. In almost all cases early diagnosis of NSCLC is elusive and most 
lung cancers have already metastasized by the time they are detected. Only 
10 16.7% are localized on initial diagnosis. If tumors can be detected at a point where 
they are confined then the combination of chemotherapy and radiation has a 
possibility of success but overall the 5year prognosis is very poor with only 10-15% 
survival rate. The picture with SCLC is even bleaker only 6% localized at Initial 
diagnosis and with 5 year survival rates of -6%. 

^-ray and computer tomography of the chest and abdomen are 
frequently used in diagnosis of lung tumors but lack sensitivity for detecting small 
foci and usually detect tumors that have already metastasized, Sputum cytology 
as a potential screening method in high-risk individuals has only been partially 
effective and often does not yield tumor type. To stage the disease CAT scan. 
20 MRI or bone scans are used to evaluate the spread of disease. Treatment for lung 
cancer is typically surgical, radiological or chemotherapy or combinations thereof, 
but usually with poor outcome due to the late diagnosis of disease. 

The current tests for lung cancer lack either the clinical sensitivity to 
detect eariy tumors or provide inadequate stage/grade infomiation or lack tumor 
25 specificity due to their originating from other tumor types or being present in benign 
lung disorders. There is therefore a need to develop specific tests that can 
improve lung cancer diagnosis and prognosis and potentially differentiate between 
NSCLC and SCLC. The present Invention achieves these and other related 
objectives by providing methods that are useful for the Identification of tissue- 
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specific poiynucieob'des. in particular tumor-specific polynucleotides, as well as 
antibodies and methods, composKions and kits for the detection and monitoring of 
cancer cells in a patient afflicted with the disease. 

BRIEF SUMMARY OF THE INVENTION 
5 The present invention provides methods for detecting the presence 

of lung cancer cells in a patient. Such methods comprise the steps of: (a) 
obtaining a biological sample from the patient; (b) contacting the biological sample 
with two or more oligonucleotide pairs specific for independent polynucleotide 
sequences which are unrelated to one another, wherein the oligonucleotide pairs 
1 0 hybridize, under moderately stringent conditions, to their respective 

polynucleotides and the complements thereof (c) amplifying the polynucleotides; 
and (d) detecting the amplified polynucleotides; wherein the piesence of one or 
more of the amplified polynucleotides indicates the presence of lung cancer cells in 
the patient. 

15 By some embodiments, detection of the amplified polynucleotides 

may be preceded by a fractionation step sudri as. for example, gel electrophoresis. 
Alternatively or additionally, detection of the amplified polynucleotides may be ' 
achieved by hybridization of a labeled oligonucleotide probe that hybridizes 
specifically, under moderately stringent conditions, to such polynucleotides. 

20 Oligonucleotide labeling may be achieved by incorporating a radiolabeled 
nucleotide or by incorporating a fluorescent label. 

In certain preferred embodiments, cells of a specific tissue type may 
be enriched from the biological sample prior to the steps of detection. Enrichment 
may be achieved by a methodology selected from the group consisting of ceil 

25 capture and cell depletion. Exemplary cell capture methods include 

immunocapture and comprise the steps of: (a) adsorising an antibody to a tissue- 
specific cell surface to cells said biological sample; (b) separating the antibody 
adsoribed tissue-specific cells finom the remainder of the biological sample. 
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Exemplary cell depletion may be achieved by cross-linking red cells and white cells 
followed by a subsequent fractionation step to remove the cross-linked cells.xxx 
Alternative embodiments of the present invention provide methods 
for determining the presence or absence of lung cancer In a patient, comprising 
5 the steps of: (a) contacting a biological sample obtained from the patient with two 
or more oligonucleotides ttiat hybridize to two or more polynucleotides that encode 
two or more lung tumor proteins; (b) detecting in the sample a level of at least one 
of the polynucleotides (such as, for example, mRNA) that hybridize to the 
oligonucleotides; and (c) comparing the level of polynucleotides that hybridize to 

10 tiie oligonucleotides with a predetermined cut-off value, and therefrom determining 
the presence or absence of lung cancer in the patient. Within certain 
embodiments, the amount of mRNA Is detected via polymerase chain reaction 
using, for example, at least one oligonucleotide primer that hybridizes to a 
polynucleotide encoding a polypeptide as recited above, or a complement of such 

15 a polynucleotide. Within other embodiments, the amount of mRNA Is detected 
using a hybridization technique, employing an oligonucleotide probe that 
hybridizes to a polynucleotide that encodes a polypeptide as recited atK>ve, or a 
complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the 

20 progression of lung cancer in a patient, comprising the steps of: (a) contacting a 
biological sample obtained from a patient with two or more oligonucleotides that 
hybridize to two or more polynucleotides that encode lung tumor proteins; (b) 
detecting in the sample an amount of the polynucleotides that hybridize to the 
oligonucleotides; (c) repeating steps (a) and (b) using a biological sample obtained 

25 from the patient at a subsequent point in time; and (d) comparing the amount of 
polynucleotide detected in step (c) with the amount detected In step (b) and 
therefrom monitoring the progression of the cancer in the patient 




Certain embodiments of the present invention provide that the step of 
amplifying said first polynucleotide and said secortd polynucleotide is achieved by 
the polymerase chain reaction (PGR). 

The present invention also provides kits that are suitable for 
5 perfomiing the detection methods of the present invention. Exemplary kits 

comprise oligonucleotide primer pairs each one of which specifically hybridizes to 
a distinct polynucleotide. Within certain embodiments, kits according to the 
present invention may also comprise a nucleic acid polymerase and suitable 
buffer. 

10 These and other aspects of the present invention will become 

apparent upon reference to the following detailed description and attached 
drawings. All references disclosed herein are hereby incorporated by reference in 
their entirety as if each was incorporated individually. 

BRIEF DESCRIPTION OF SEQUENCE IDENTIFIERS 

15 SEQ ID NO: 1 is the determined cDNA sequence L762P. 

SEQ ID NO: 2 is the amino add sequence encoded by the sequence 
of SEQ ID NO: 1. 

SEQ ID NO: 3 is the determined cDNA sequence L984P. 
SEQ ID NO: 4 Is the amino add sequence encoded by the sequence 
20 of SEQ ID NO: 3. 

SEQ ID NO: 5 Is the detemiined cDNA sequence 550S. 
SEQ ID NO: 6 is the amino add sequence encoded by the sequence 
of SEQ ID NO: 5. 

. SEQ ID NO: 7 Is the determined cDNA sequence L552S. 
25 SEQ ID NO: 8 is the amino acid sequence encoded by the sequence 

of SEQ ID NO: 7. 

SEQ ID NO:9 is the DMA sequence of L552S INT fonvard primer. 
SEQ ID NO:10 is the DNA sequence of L552S INT reverse primer. 
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SEQ ID NO:11 is the DNA sequence of L552S Taqman probe. 
SEQ ID NO:12 Is the DNA sequence of L550S INT forward primer. 
SEQ ID NO:1 3 is the DNA sequence of L550S INT reverse primer. 
SEQ ID Nd:14 is the DNA sequence of L550S Taqman probe. 
5 SEQ ID NO:1 5 is the DNA sequence of L726P INT forwartl primer. 

SEQ ID NO:16 is the DNA sequence of L726P INT reverse primer. 
SEQ ID NO:1 7 is the DNA sequence of L726P Taqman probe. 
SEQ ID NO:18 is the DWK sequence of L984P INT fooA/ard primer. 
SEQ ID NO:19 is the DNA sequence of L984P INT reverse primer. 
1 0 SEQ ID NO:20 is the DNA sequence of L984P Taqman probe. 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is directed generally to 
methods that are suitable for the Identification of tissue-specific polynucleotides as 
well as to methods, compositions and kits that are suitable for the diagnosis and 

15 monitoring of lung cancer, in particular such methods, compositions and kits are 
suitable for use In the diagnosis, differentiation and/or prognosis of NSCLC and 
SCLC. Such diagnostic methods will form the basis for a molecular diagnostic 
test for detecting lung cancer metastases in lung tissue and for the detection of 
anchorage independent lung cancer cells in blood as well as in mediastinal lymph 

20 nodes of distant metastases. 

A variety of genes have been identified as over-expressed in lung 
tumors, In particular squamous or adeno fonns of NSCLC or small cell carcinomas. 
These include, but are not limited to: L762P, L984P. L550S/L548S. L552S/L647S, 
L552/L547S, L200T, L514S, L551S, L587S, L763S. L773P, L801P. L985P. 

25 L1 058C. LI 081 C, L523S, OF1 783P, B307D (WlPO International Patent 

Application Nos: WO 99/47674. published September 23, 1999; WO 00/61612, 
published October 19. 2000; WO 02/00174, published January 3. 2002; WO 
02/47534, published June 20, 2002; WO 01/72295, published October 4. 2001 ; 
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WO 02/092001, published November 21, 2002; WO 01/00828. published January 
1 . 2001 ; WO 02/04514. published January 1 7. 2002; WO 01/92525. published 
December 6, 2002; WO 02/02623, published January 10. 2002. US Patent Nos: 
Wang et al., 6.482.597, Issued November 22. 2002; Wang et al.. 6.518,256, Issued 
5 February 1 1 , 2003; Wang et al.. 6.426,072, issued July 30, 2002; Reed et al.. 
6,210,883, issued April 3. 2001; Wang et al., 6.504.010, issued January 7, 2003; 
Wang et al., 6,509,448. Issued January 21, 2003. Wang etal; Oncogene; 
2f (49):7598-604. 2002 (collagen type XI alpha 1 ).). 

These genes were identified and characterized using PGR and cDNA 

1 0 library subtractions as well as electronic subtractions with each of the tumor types 
IndMdually. The cDNAs identified were then evaluated by microarray then by Real 
Time PGR on tissue panels to identify specific expression patterns. Table 1 
highlights the specificity of these genes for either adeno or squamous forms of 
NSGLG or both as well as genes specific for small cell lung carcinomas. In some 

1 5 cases reactivity with large cell cardnomas has also been identified by Real Time 
PGR analysis. 



Table 1 



Gene 


Squamous 


Adeno 


Small cell 


Large cell 


Nomial 
Lung 


L762P 


+++++ 


+ 








L984P 




+ 


+++ 






L550S/L548S 




+++++ ' 


+ 






L552S/L547S 


++ 


+++++ 








L200T 




++ 




++ 




L514S 


++++ 


++++ 








L551S 




++++ 




+/- 




L587S 


+ 


+ 


+++ 


+ 




L763P 


+++++ 










L773P 


+++ 


+++ 








L801P 


++++ 


++++ 




++ 




L978P 


+ 


++ 


+++++ 


+/- 




L985P 




+ 


+++++ 






L1058C 




I++ 
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L1081C 






++ 






L523S 


+++++ 


+++++ 


+ 


++ 




OF 1783P 






+++++ 






B307D 


++ 


++ 




+ 





Identification of Tissue-specific Polynucleotides ' 

Certain ennbodiments of the present invention provide methods, 
compositions and l^its for the detection of lung cancer cells within a biological 
5 sample from patients with different type, stage and grade of tumors. These 
methods comprise the step of detecting one or more tissue-specific 
polynucleotide(s)from a patient's biological sample the over-expression of which 
polynucleotides indicates the presence of lung cancer cells within the patient's 
biological sample. Accordingly, the present Invention also provides methods that 

1 0 are suitable for the identification of tissue-specific polynucleotides. As used 
herein, the phrases "tissue-specific polynucleotides" or 'tumor-specific 
polynucleotides" are meant to include all polynucleotides that are at least two-fold 
over-expressed as compared to one or more control tissues. As discussed in 
further detail herein below, over-expression of a given polynucleotide may be 

15 assessed, for example, by microanray and/or quantitative real-time polymerase 
chain reaction (Real-time PGR™) methodologies. 

Exemplary methods for detecting tissue-specific polynucleotides may 
comprise the steps of: (a) performing a genetic subtraction to identify a pool of 
polynucleotides from a tissue of interest; (b) perfomiing a DNA microarray analysis 

20 to identify a first subset of said pool of polynucleotides of interest wherein each 
member polynucleotide of said first subset is at least two-fold over-expressed in 
said tissue of interest as compared to a control tissue; and (c) performing a 
quantitative polymerase chain reaction analysis on polynucleotides within said first 
subset to identify a second subset of polynucleotides that are at least two-fold 

25 over-expressed as compared to said control tissue. 
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Polynucleotides Generally 

As used herein, the term "polynucleotide" refers generally to either 
DNA or RNA molecules. Polynucleotides may be naturally occuning as normally 
found in a biological sample such as blood» serum, lymph node, bone marrow, 
5 sputum, urine and tumor biopsy samples. Alternatively, polynucleotides may be 
derived synthetically by, for example, a nucleic acid polymerization reaction. As 
will be recognized by the skilled artisan, polynucleotides may be single-stranded 
(coding or antisense) or double-stranded, and may be DNA (genomic, cDNA or 
synthetic) or RNA molecules. RNA molecules include HnRNA molecules, which 

10 contain introns and correspond to a DNA molecule in a one-to-one manner, and 
mRNA molecules, which do not contain introns. Additional coding or non-coding 
sequences may, but need not, be present within a polynucleotide of the present 
invention, and a polynucleotide may, but need not, be linked to other molecules 
and/or support materials. 

15 Polynucleotides may comprise a native sequence {I.e., an 

endogenous sequence that encodes a tumor protein, such as a lung tumor protein, 
or a portion thereof) or may comprise a variant, or a biological or antigenic 
functional equivalent of such a sequence. Polynucleotide variants may contain 
one or more substitutions, additions, deletions and/or insertions, as further 

20 described below. The term 'Variants" also encompasses homologous genes of 
xenogenic origin. 

When comparing polynucleotide or polypeptide sequences, two 
sequences are said to be "identical" if the sequence of nucleotides or amino acids 
in the two sequences is the same when aligned for maximum correspondence, as 

25 described below. Comparisons between two sequences are typically performed by 
comparing the sequences over a comparison window to identify and compare local 
regions of sequence similarity. A "comparison window" as used herein, refers to a 
segment of at least about 20 contiguous positions, usually 30 to about 75, 40 to 
about 50, In which a sequence may be compared to a reference sequence of the 
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same number of contiguous positions after the two sequences are optimally 
aligned. 

Optimal alignment of sequences for comparison may be conducted 
using the Megalign program In the Lasergene suite of biolnfbnnatlcs software 
(DNASTAR. Inc.. Madison, Wl), using default parameters. This program embodies 
several alignment schemes described in the following references: Dayhoff, M.O. 
(1 978) A model of evolutionary change In proteins - Matrices for detecting distant 
relationships. In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure. 
National Biomedical Research Foundation, Washington DC Vol. 5, Suppl. 3. pp. 
345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes pp. 626- 
645 Methods in Enzymologywol 183, Academic Press, Inc., San Diego, OA; 
Higgins. D.G. and Sharp, P.M. (1989) CABIOS 5:151-163; Myere, E.W. and Muiler 
W. (1988) C/\e/OS 4:1 1-17; Robinson, E.D. (1971) Comb. Theor 11:W5; Santou, 
N. Nes. M. (1987) MoL Biol. Evo/. 4:406-425; Sneath. P.H.A. and Sokal, R.R. 
(1973) Numerical Taxonomy- the Principles and Practice of Numerical Taxonomy, 
Freeman Press, San Francisco, CA; Wilbur, VV.J. and Lipman, D.J. (1983) Proc. 
Natl. Acad., Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local Identity algorithm of Smith and Waterman (1 981 ) Add. APL 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) 
J. Mol. Biol. 48:443, by the search for similarity methods of Pearaon and Lipman 
(1988) Proc. Natl. Acad. Sci. USA 85: 2444. by computerized implementations of 
these algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., 
Madison, Wl), or by inspection. 

One prefen^d example of algorithms that are suitable for detemiining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389- 
3402 and Altschul et al. (1990) J. Mol. Biol. 215:403-410. respectively. BLAST and 
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BLAST 2,0 can be used, for example with the parameters described herein, to 
detemnlne percent sequence identity for the polynucleotides and polypeptides of 
the Invention. Software for perfomiing BLAST analyses is publicly available 
through the National Center for Biotechnology Information. In one illustrative 
example, cumulative scores can be calculated using, for nucleotide sequences, the 
parameters M (reward score fora pair of matching residues; always >0) and N 
(penalty score for mismatching residues; always <0). For amino acid sequences, a 
scoring matrix can be used to calculate the cumulative score. Extension of the 
word hits in each direction are halted when: the cumulative alignment score falls 
off by the quantity X from its maximum achieved value; the cumulative score goes 
to zero or below, due to the accumulation of one or more negative-scoring residue 
alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T and X determine the sensitivity and speed of the alignment. The 
BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 
1 1 . and expectation (E) of 10. and the BLOSUM62 scoring matrix (see Henikoff 
and Henikoff (1989) Proc. Natl. Acad. ScL USA 89:10915) alignments, (B) of 50, 
expectation (E) of 10, M=5. N=-4 and a comparison of both strands. 

Preferably, the "percentage of sequence identity" is detennined by 
comparing two optimally aligned sequences over a window of comparison of at 
least 20 positions, wherein the portton of the polynucleotide or polypeptide 
sequence in the comparison window may comprise additk>ns or deletions {i.e., 
gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 percent, as 
compared to the reference sequences (which does not comprise additions or 
deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical nucleic 
acid bases or amino acid residue occurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total number 
of positions in the reference sequence (i.e., the window size) and multiplying the 
results by 100 to yield the percentage of sequence kJentity. 



Therefore, the present invention encompasses polynucleotide and 
polypeptide sequences having substantial identity to the sequences disclosed 
herein, for example those comprising at least 50% sequence identity, preferably at 
least 55%. 60%. 65%. 70%, 75%, 80%. 85%, 90%, 95%. 96%. 97%. 98%. or 99% 
5 or higher, sequence identity compared to a polynucleotide or polypeptide 
sequence of this invention using ttie mettiods described herein, (e.g:, BLAST 
analysis using standard parameters, as described below). One skilled in this art 
will recognize that ttiese values can be appropriately adjusted to determine 
coniesponding Identity of proteins encoded by two nucleotide sequences by taking 
10 into account codon degeneracy, amino acid similarity, reading frame positioning 
and the like. 

In additional embodiments, the present invention provides isolated 
polynucleotides and polypeptides comprising various lengths of contiguous 
stretches of sequence identical to or complementary to one or more of the 

1 5 sequences disctosed herein. For example, polynucleotides are provkJed by this 
invention that comprise at least about 15, 20. 30, 40, 50, 76. 100. 150. 200, 300, 
400, 500 or 1 000 or more contiguous nucleotides of one or more of the sequences 
disclosed herein as well as all Intermediate lengths there between. It will be 
readily understood that "Intemiediate lengths", In this context, means any length 

20 between the quoted values, such as 16, 1 7, 18. 19, etc.; 21 . 22, 23. etc.; 30, 31 , 
32. etc.; 50. 51. 52, 53, efc; 100. 101, 102. 103, eto.; 150. 151. 152. 153, ©fc; 
including all Integers through 200-500; 500-1,000, and the like. 

The polynucleotides of the present Invention, or fragments thereof, 
regardless of ttie length of the coding sequence itself, may be combined with other 

25 DNA sequences, such as promoters, polyadenylation signals,, additional restriction 
enzyme sites, multiple cloning sites, other coding segments, and the like, such that 
their overall lengtii may vary considerably. It is therefore contemplated that a 
nucleic acid fragment of almost any length may be employed, with the total lengtti 
preferably being limited by the ease of preparation and use in the iiitended 
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recombinant DNA protocol. For example. Illustrative DNA segments with total 
lengths of about 10,000, about 6000. about 3000, about 2.000. about 1.000. about 
500. about 200, about 100. about 50 base pairs In length, and the like, (including 
all intermediate lengths) are contemplated to be useful In many implementations of 
5 this Invention. 

In other embodiments, the present invention is directed to 
polynucleotides that are capable of hybridizing under moderately stringent 
conditions to a polynucleotide sequence provided herein, or a fragment thereof, or 
a complementary sequence thereof. Hybridization techniques are well known in 

1 0 the art of molecular biology. For purposes of illustration, suitable moderately 

stringent conditions for testing the hybridization of a polynucleotide of this invention 
with other polynucleotides include prewashing in a solution of 5 X SSC. 0.5% SDS. 
1 .0 mM EDTA (pH 8.0); hybridizing at SO'C-OS-C. 5 X SSC, pvemight; followed by 
washing twice at 66X for 20 minutes with each of 2X, 0.5X and 0.2X SSC 

1 5 containing 0. 1 % SDS. 

Moreover, it will be appreciated by those of ordinary skill in the art 
that, as a result of the degeneracy of the genetic code. therB are many nucleotide 
sequences that encode a polypeptide as described herein. Some of these 
polynucleotides bear minimal homotogy to the nucleotWe sequence of any native 

20 gene. Nonetheless, polynucleotides that vary due to differences In codon usage 
are speciffcally contemplated by the present invention. Further, alleles of the 
genes comprising the polynucleotide sequences provided herein are within the 
scope of the present invention. Alleles are endogenous genes that are altered as 
a result of one or more mutations, such as deletions, additions and/or substitutions 

25 of nucleotides. The resulting mRNA and protein may. but need not. have an 

altered structure or function. Alleles may be Identified using standard techniques 
(such as hybridization, amplification and/or database sequence comparison). 
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Microarrav Analyses 

Polynucleotides that are suitable for detection according to the 
methods of the present invention may be identified, as described In more detail 
below, by screening a microarray of cDNAs for tissue and/or tumor-associated 
5 expression (e.g., expression that is at least two*fold greater in a tumor than in 
nomial tissue, as determined using a representative assay provided herein). Such 
screens may be perfomied, for example, using a Synteni microarray (Palo Alto, 
CA) according to the manufacturer's instructions (and essentially as described by 
Schena etal., Proc. Natl. Acad. ScL USA 93:10614-10619 (1996) and Heller etal., 
10 Proc. Natl. Acad, ScL USA 94:2150-2155 (1997)). 

Microarray is an effective method for evaluating large numbers of 
genes but due to its limited sensitivity it may not accurately determine the absolute 
tissue distribution of low abundance genes or may underestimate the degree of 
overexpression of more abundant genes due to signal saturation. For those genes 
1 5 showing overexpression by microarray expression profiling, further analysis was 
. performed using quantitative RT-PCR based on Taqman™ probe detection, which 
comprises a greater dynamic range of sensitivity. Several different panels of 
normal and tumor tissues, distant metastases and cell lines were used for this 
purpose. 

20 Quantitative Real-time Polymerase Chain Reaction 

Suitable polynucleotides according to the present Invention may be 
further characterized or, alternatively, originally identified by employing a 
quantitative PGR methodology such as. for example, the Real-time PGR 
methodology. By this methodology, tissue and/or tumor samples, such as, e.g., 
25 metastatic tumor samples, may be tested along side the corresponding normal 
tissue sample and/or a panel of unrelated normal tissue samples. 

Real-time PGR (see Gibson et al.^Genome Research 6:995-1001 , 
1996; Held et al., Genome Research 6:986-994, 1996) is a technique that 
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eyaluates the level of PGR product accumulation during amplification. This 
technique permits quantitative evaluation of mIRNA levels in multiple samples^ 
Briefly. mRNA is extracted from tumor and nonnal tissue and cDNA is prepared 
using standard techniques. 
5 Real-time PGR may. for example, be perfomned either on the ABI 

7700 Prism or on a GeneAmp® 5700 sequence detection system (PE Biosystems, 
Foster Gity. GA). The 7700 system uses a forward and a reverse primer In 
combination with a specific probe with a 5' fluorescent reporter dye at one end and 
a 3' quencher dye at the other end (Taqman™). When the Real-tirne PGR is 

10 performed using Taq DNA polymerase with 5-3' nuclease activity, the probe is 
cleaved and begins to fluoresce allowing the reaction to be monitored by the 
increase in fluorescence (Real-time). The 5700 system uses SYBR® green, a 
fluorescent dye. that only binds to double stranded DNA, and the same fonA^ard 
and reverse primers as the 7700 instrument. Matching primers and fluorescent 

1 5 probes may be designed according to the primer express program (PE 

Biosystems, Foster Gity. GA). Optimal concentrations of primers and probes are 
Initially detennlned by those of ordinary skill in the art Gontrol (e.g., p-actin) 
primers and probes may be obtained commercially from, for example. Perkin 
Elmer/Applied Biosystems (Foster Gity. GA). 

20 To quantitate the amount of specific RNA in a sample, a standard 

curve is generated using a plasmid coritalning the gene of interest. Standard 
curves are generated using the Gt values determined In the real-time PGR, which 
are related to the Initial cDNA concentration used in the assay. Standard dilutions 
ranging from 10-10® copies of the gene of interest are generally sufficient. In 

25 addition, a standard curve is generated for the control sequence. This permits 
standardization of initial RNA content of a tissue sample to the amount of control 
for comparison purposes. 

In accordance with the above, and as described further below, the 
present invention provides the lllusta^tive lung tissue- and/or tumor-specific 
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polynucleotides L552S. L550S, L762P and L984P having sequences set forth in 
SEQ ID NOs: 1. 3, 5 and 7, illustrative polypeptides encoded thereby having amino 
acid sequences set forth in SEQ ID NO: 2, 4. 6 and 8 that may be suitably 
employed In the detection of cancer, more specifically, lung cancer. 

Methodologies for the Detection of Cancer 

In general, a cancer cell may be detected in a patient based on the 
presence of one or more polynucleotides within cells of a biological sample (for 
example, blood, lymph nodes, bone marrow, sera, sputum, urine and/or tumor 
biopsies) obtained from the patient. In other words, such polynucleotides may be 
used as markers to indicate the presence or absence of a cancer such as, e.g., 
lung cancer. 

As discussed in further detail herein, the present invention achieves 
these and other related objectives by providing a methodology for the 
simultaneous detection of more than one polynucleotide, the presence of which is 
diagnostic of the presence of lung cancer cells in a biological sample. Each of the 
various cancer detection methodologies disclosed herein have In common a step 
of hybridizing one or more oligonucleotide primers and/or probes, the hybridization 
of which is demonstrative of the presence of a tumor- and/or tissue-specific 
polynucleotide. Depending on the precise application contemplated, It may be 
preferred to employ one or more intron-spanning oligonucleotides that are 
inoperative against polynucleotide' of genomic DNA and, thus, these 
oligonucleotides are effective In substantially reducing and/or eliminating the 
detection of genomic DNA in the biological sample. 

Further disclosed herein are methods for enhancing the sensitivity of 
these detection methodologies by subjecting the biological samples to be tested to 
one or more cell capture and/or cell depletion methodologies. 

By certain embodiments of the present invention, the presence of 
lung cancer cell in a patient may be determined by employing the following steps: 
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(a) contacting a biological sample obtained from the patient with two or^more 
oligonucleotides that hybridize to two or more polynucleotides that encode two or 
more lung tumor proteins as described herein; (b) detecting in the sample a level 
of at least one of the polynucleotides (such as, for example, mRNA) that hybridize 
to the oligonucleotides; and (c) comparing the level of polynucleotides that 
hybridize to the oligonucleotides with a predetermined cut-off value, and therefrom 
determining the presence or absence of lung cancer In the patient. 

To permit hybridization under assay conditions, oligonucleotide 
primers and probes should comprise an oligonucleotide sequence that has at least 
about 60%, preferably at least about 75% and more preferably at least about 90%, 
Identity to a portion of a polynucleotide encoding a lung tumor protein that is at 
least 10 nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers hybridize to .a polynucleotide encoding a polypeptide 
described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers which may be usefully employed in the diagnostic 
methods described herein preferably are at least 10-40 nucleotides in length. In a 
preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA 
molecule having a sequence recited in SEQ ID NO: 1 , 3, 5 or 7. Techniques for 
both PGR based assays and hybridization assays are well known in the art (see. 
for example, Mullisef a/., Cold Spring Harbor Symp. Quant BioL, 5f:263, 1987; 
Erilch ed., PCR Technology, Stockton Press. NY, 1989). 

The present invention also provides amplification-based methods for 
detecting the presence of lung cancer cells In a patient. Exemplary methods 
comprise the steps of (a) obtaining a biological sample from the patient; (b) 
contacting the biological sample with two or more oligonucleotide pairs specific for 
independent polynucleotide sequences which are unrelated to one another, 
wherein the oligonucleotide pairs hybridize, under moderately stringent conditions, 
to their respective polynucleotides and the complements thereof ,(c) amplifying the 
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polynucleotides; and (d) detecting the amplified polynucleotides; wherein the 
presence of one or more of the amplified polynucleotides Indicates the presence of 
lung cancer cells in the patient. 

Methods according to the present invention are suitable for 
Identifying polynucleotides obtained from a wide variety of biological sample such 
as, for example, blood, serum, lymph node, bone marrow, sputum, urine and tumor 
biopsy sample, among others. 

Certain exemplary embodiments of the present invention provide 
methods wherein the polynucleotides to be detected are selected from the group 
consisting of L762, L984. L550. and L552. Alternatively and/or additionally, 
polynucleotides to be detected may be selected fiom the group consisting of those 
depicted in SEQ ID NOs: 1 , 3. 5, and 7. 

Suitable exemplary oligonucleotide probes and/or primers that may 
be used according to the methods of the present Invention are disclosed herein. 
In certain prefenred embodiments that eliminate the background detection of 
genomic DNA, the oligonucleotides may be intron spanning oligonucleotides. 

Depending on the precise application contemplated, the artisan may 
prefer to detect the tissue- and/or tumor-specific polynucleotides by detecting a 
radiolabel and detecting a fluorophore. More specifically, the oligonucleotide 
probe and/or primer may comprises a detectable moiety such as, for example, a 
radiolabel and/or a fluorophore. 

Alternatively or additionally, methods of tfie present invention may 
also comprise a step of fractionation prior to detection of the tissue- and/or tumor- 
specific polynucleotides such as, for example, by gel electrophoresis. 

In other embodiments, methods described herein may be used as to 
monitor the progression of cancer. By these embodiments, assays as provided for 
the diagnosis of lung cancer may be peri'ormed over time, and the change in the 
level of reactive polypeptide(s) or polynucleotide(s) evaluated. For example, ttie 
assays may be performed every 24-72 hours for a period of 6 months to 1 year. 
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and thereafter performed as needed. In general, a cancer is progressing in those 
patients in whom the level of polypeptide or polynucleotide detected increases 
over tinfie. In contrast, the cancer Is not progressing when the level of reactive 
polypeptide or polynucleotide either remains constant or decreases with time. 

Certain In vivo diagnostic assays may be performed directly on a 
tumor. One such assay Involves contacting tumor cells with a binding agent The 
bound binding agent may then be detected directly or indirectly via a reporter 
group. Such binding agents may also be used in histological applications. 
Altematively, polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple lung tumor protein 
markers may be assayed within a given sample. It will be apparent that binding 
agents specific for different proteins provided herein may be combined within a 
single assay. Further, multiple primers or probes may be used concurrently. The 
selection of tumor protein markers may be based on routine experiments to 
determine combinations that results in optimal sensitivity. In addition, or 
alternatively, assays for tumor proteins provided herein may be combined with 
assays for other known tumor antigens. 

Cell Enrichment 

In other aspects of the present invention, cell capture technologies 
may be used prior to polynucleotide detection to Improve the sensitivity of the 
various detection methodologies disclosed herein. 

Exemplary cell enrichment methodologies employ immunomagnetic 
beads that are coated with specific monoclonal antibodies to surface cell markers, 
or tetrameric antibody complexes, may be used to first enrich or positively select 
cancer cells in a sample. Various commercially available kits may be used, 
including Dynabeads® Epithelial Enrich (Dynal Biotech, Oslo, Norway), 
StemSep™ (StemCell Technologies, Inc., Vancouver, BO), and RosetteSep 
(StemCell Technologies). The skilled artisan will recognize that other readily 
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available methodologies and kits may also be suitably employed to enrich or 

positively select desired cell populations. 

Dynabeads® Epithelial Enrich contains magnetic beads coated with 

mAbs specific for two glycoprotein membrane antigens expressed on normal and 
5 neoplastic epithelial tissues, the coated beads may be added to a sample and the 

sample then applied to a magnet, thereby capturing the cells bound to the beads. 

The unwanted cells are washed away and the magnetically isolated cells eluted 

from the beads and used in further analyses, 

RosetteSep can be used to enrich cells directly from a blood sample 
1 0 and consists of a cocktail of tetrameric antibodies that target a variety of unwanted 

cells and crosslinks them to glycophorin A on red blood cells (RBC) present in the 

sample, forming rosettes. When centrifuged over Ficoll, targeted cells pellet along 

with the free RBC. 

The combination of antibodies in the depletion cocktail determines 
15 which cells will be removed and consequently which cells will be recovered. 

Antibodies that are available include, but are not limited to: CD2, CDS, CD4, CDS. 

CDS, CD10, CD11b. CD14, CD15, CD16. GDI 9, CD20, CD24, CD25, CD29, 

CD33, CD34, CD36, CD38, CD41, CD45, CD45RA. CD45RO, CD66, CD66B, 

CD66e, HLA-DR, IgE, and TCRop. Additionally, it is contemplated In the present 
20 invention that mAbs specific for lung tumor antigens, can be developed and used 

In a similar manner. For example, mAbs that bind to tumor-specific cell surface 

antigens may be conjugated to magnetic beads, or formulated in a tetrameric 

antibody complex, and used to enrich or positively select metastatic lung tumor 

cells from a sample. Such a system can be used to evaluate blood samples from 
is different fomis of lung cancers, in particular adneo and squamous forms of NSCLC 

and small cell carcinomas for the presence of circulating tumor cells using the 

inventive multiplex PCR assay as described herein. 

Once a sample is enriched or positively selected, cells may be further 

analyzed. For example, the cells may be lysed and RNA isolated. RNA may then 
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be subjected to RT-PCR analysis using lung tunnor-specific multiplex primers in a 
Real-time PGR assay as described herein. 

in another aspect of the present Invention, ceil capture technologies 
may be used in conjunction with Real-Time PGR to provide a more sensitive tool 
5 for detection of metastatic cells expressing lung tumor antigens. 

Yet another method that may be employed is an anti-ganglioside 
Gmi/Gmi cell capture antibody system. Gangliosides are cell membrane bound 
glycosphingolipids, several species of which have been shown to be over- 
expressed on the cell surface of most cancers of neuroectodermal and epithelial 
10 origin, in particular lung cancer Cell surface expression of Gm2 is seen in several 
types of lung cancer, particulariy in SCLC which make it an attractive target for a 
monoclonal antibody based lung cancer immunotherapy and also for use as a 
capture method in conjunction with Gmi. 



Probes and Primers 

15 As noted above and as described In further detail herein, certain 

methods, compositions and kits according to the present invention utilize two or 
more oligonucleotide primer pairs for the detection of lung cancer. The ability of 
such nucleic acid probes to specifically hybridize to a sequence of interest will 
enable them to be of use in cletecting the presence of complementary sequences 

20 in a biological sample. 

Alternatively, in other embodiments, the probes and/or primers of the 
present invention may be employed for detection via nucleic acid hybridization. As 
such, it is contemplated that nucleic acid segments that comprise a sequence 
region of at least about 15 nucleotide long contiguous sequence that has the same 

25 sequence as, or is complementary to, a 1 5 nucleotide long contiguous sequence of 
a polynucleotide to be detected will find particular utility. Longer contiguous 
Identical or complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 
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200, 500, 1000 (including all intemnediate lengths) and even up to full length 
sequences will also be of use in certain embodiments. 

Oligonucleotide primers having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20. 30, 50, or even of 100t200 
nucleotides or so (including intemnediate lengths as well), identical or 
complementary to a polynucleotide to be detected, are particularly contemplated 
as primers for use in amplification reactions such as, e.g., the polymerase chain 
reaction (PGR™). This would allow a polynucleotide to be analyzed, both in 
diverse biological samples such as, for example, blood, lymph nodes and bone 
marrow. 

The use of a primer of about 1 6-25 nucleotides in length allows the 
formation of a duplex molecule that is both stable and selective. Molecules having 
contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though. In order to increase stability and selectivity 
of the hybrid, and thereby improve the quality and degree of specific hybrid 
molecules obtained. One will generally prefer to design primers having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Primers may be selected from any portion of the polynucleotide to be 
detected. All that is required is to review the sequence, such as those exemplary 
polynucleotides set forth herein or to any continuous portion of the sequence, from 
about 15-25 nucleotides in length up to and including the full length sequence, that 
one wishes to utilize as a primer. The choice of primer sequences may be 
governed by various factors. For example, one may wish to employ primers from 
towards the termini of the total sequence. The exemplary primers disclosed herein 
may optionally be used for their ability to selectively form duplex molecules with 
complementary stretches of the entire polynucleotide of interest such as those set 
forth SEQ ID NOs: 1 , 3 ,5 or 7. 
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The present invention further provides the nucleotide sequence of 
various exemplary oligonucleotide primers and probes, that may be used, as 
described In further detail herein, according to the methods of the piBsent 
Invention for the detection of cancer. 

Oligonucleotide primers acxxirding to the present Invention may be 
readily prepared routinely by methods commonly available to the skilled artisan 
Including, for example, directly synthesizing the primers by chemical means, as is 
commonly practiced using an automated oligonucleotide synthesizer Depending 
on the application envisioned, one will typically desire to employ varying conditions 
of hybridization to achieve varying degrees of selectivity of probe towards target 
sequence. For applications requiring high selectivity, one will typically desire to 
employ relatively stringent conditions to form the hybrids, e.g.. one will select 
relatively low salt and/or high temperature conditions, such as provided by a salt 
concentration of from about 0.02 M to about 0.15 M salt at temperatures of from 
about 50°C to about 70°C. Such selective conditions tolerate little, if any, 
mismatch between the probe and the template or target strand, and would be 
partlculariy suitable for Isolating related sequences. 

Polvnucleotlde Amplification Techniques 

Each of the specific embodiments outlined herein for the detection of 
lung cancer has in common the detection of a tissue- and/or tumor-specific 
polynucleotide via the hybridization of one or more oligonucleotide primers and/or 
probes. Depending on such factors as the relative number of cancer cells present 
in the biological sample and/or the level of polynucleotide expression within each 
lung cancer cell, it may be preferred to perfbnn an amplification step prior to 
performing the steps of detection. For example, at least two oligonucleotide 
primers may be employed in a polymerase chain reaction (PGR) based assay to 
amplify a portion of a lung tumor cDNA derived from a biological sample, wherein 
at least one of the oligonucleotide primers Is specific for (i.e.. hybridizes to) a 
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polynucleotide encoding the lung tumor protein. The amplified cDNA may 
optionally be subjected to a fractionation step such as, for example, gel 
electrophoresis. 

A number of template dependent processes are available to amplify 
the target sequences of interest present in a sample. One of the best known 
amplification methods Is the polymerase chain reaction (PCRJ*«) which is described 
in detail in U.S. Patent Nos. 4.683,195, 4,683,202 and 4.800,159. Briefly, in 
PCR™, two primer sequences are prepared which are complementary to regions 
on opposite complementary strands of the target sequence. An excess of 
deoxynucleoside triphosphates is added to a reaction mixture along with a DNA 
polymerase (e.g., Taq polymerase). If the taiget sequence is present in a sample, 
the primers will bind to the target and the polymerase will cause the primers to be 
extended along the target sequence by adding on nucleotides. By raising and 
lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to fomn reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PGR™ amplification procedure may be perfomned in order to 
quantify the amount of mRNA amplified. Polymerase chain reaction 
methodologies are well known in the art. 

One prefened methodology for polynucleotide amplification employ 
RT-PCR, In which PGR is applied in conjunction with reverse transcription. 
Typically, RNA is extracted from a biological sample, such as blood, seaim, lymph 
node, bone marrow, sputum, urine and tumor biopsy samples, and is reverse 
transcribed to produce cDNA molecules. PGR amplification using at least one 
specific primer generates a cDNA molecule, whidi may be separated and 
visualized using, for example, gel electrophoresis. Amplification may be perfomied 
on blologfcal samples taken from a patient and from an individual who is not 
afflicted vWth a cancer. The amplification reaction may be performed on several 
dilutions of cDNA spanning two orders of magnitude. A two-fold or greater 
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Increase in expression in several dilutions of the test patient sample as compared 
to the same dilutions of the non-cancerous sample is typically considered positive. 

Any of a variety of commercially available kits may be used to 
perform the amplification step. One such amplification technique is inverse PGR 
5 (see Triglia ef a/., NucL Acids Res. f 6:8186. 1988), which uses restriction enzymes 
to generate a fragment in the known region of the gene. The fragment is then 
circularized by Intramolecular ligation and used as a template for PGR with 
divergent priiners derived from the known region. Within an alternative approach, 
sequences adjacent to a partial sequence may be retrieved by amplification with a 

10 primer to a linker sequence and a primer speclffc to a known region. The amplified 
sequences are typicaUy subjected to a second round of amplification with the same 
linker primer and a second primer specific to the known region. A variation on this 
procedure, which employs two primers that Initiate extension in opposite directions 
from the known sequence, is described in WlPO International Patent Application 

1 5 No.: WO 96/38591 . Another such technique Is known as "rapid amplification of 
cDNA ends" or RACE. This technique involves the use of an internal primer and 
an external primer, which hybridizes to a polyA region or vector sequence, to 
identify sequences that are 5* and 3' of a known sequence. Additional techniques 
include capture PGR (Lagerstrom etaf.. PCR Methods Applic. 7:111-19, 1991) and 

20 walking PGR (Paricer ef a/.. Nucl. Acids. Res. 79:3055-60. 1991). Other methods 
employing amplification may also be employed to obtain a full length cDNA 
sequence. 

Another method for amplification Is the ligase chain reaction (referred 
to as LCR). disclosed In Eur. Pat Appl. Publ. No. 320.308. in LCR, two 
25 complementary probe pairs are prepared, and in the presence of the target 

sequence, each pair will bind to opposite complementary strands of the target such 
that they abut. In the presence of a ligase, the two probe pairs will link to form a 
single unit By temperature cycling, as in PCRTm, bound ligated units dissociate 
from the target and then serve as "target sequences" for ligation of excess probe 
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pairs. U.S. Patent No. 4,883,750, describes an alternative method of amplification 
similar to LCR for binding probe pairs to a target sequence. 

Qbeta Replicase, described in PCT Intl. Pat. Appl. PubL No. 
PCT/US87/00880, may also be used as still another amplification method in the 
5 present invention. In this method, a replicative sequence of RNA that has a region 
complementary to that of a target is added to a sample in the presence of an RNA 
polymerase. The polymerase will copy the replicative sequence that can then be 
detected. 

An isothermal amplification method, in which restriction 
10 endonucleases and Hgases are used to achieve the amplification of target 
molecules that contain nucleotide 5'-[a-thlo]trlphosphates in one strand of a 
restriction site (Walker et aL, 1992), may also be useful in the amplification of 
nucleic acids in the present invention. 

Strand Displacement Amplification (SDA) is another method of 
1 5 canying out isothermal amplification of nucleic acids which involves multiple 
rounds of strand displacement and synthesis, Le., nick translation. A similar 
method, called Repair Chain Reaction (RCR) Is another method of amplification 
which may be useful in the present Invention and is involves annealing several 
probes throughout a region targeted for amplification, followed by a repair reaction 
20 in which only two of the four bases are present. The other two bases can be 

added as biotinylated derivatives for easy detection. A similar approach is used in 
SDA. 

Sequences can also be detected using a cyclic probe reaction 
(CPR). In CPR, a probe having a 3* and 6' sequences of non-target DNA and an 
25 intemal or "middle" sequence of the target protein specific RNA is hybridized to 
DNA which is present in a sample. Upon hybridization, the reaction is treated with 
RNaseH, and the products of the probe are identified as distinctive products by 
generating a signal that is released after digestion. The original template is 
annealed to another cycling probe and the reaction is repeated. Thus, CPR 

26 



involves amplifying a signal generated by hybridization of a probe to a taiget gene 
specific expressed nucleic acid. 

Still other amplification methods described in Great Britain Pat. Appl. 
No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025, may be 
used in accordance with the present invention. In the fonner application, 
"modified" primers are used In a PCR-like, template and enzyme dependent 
synthesis. The primers may be modified by labeling with a capture moiety (e.g., 
biotin) and/or a detector moiety (e.g., enzyme). In the latter application, an excess 
of labeled probes is added to a sample. In the presence of the taiget sequence, 
the probe binds and is cleaved catalytically. After cleavage, the target sequence is 
released intact to be bound by excess probe. Cleavage of the labeled probe 
signals the presence of the target sequence. 

Other nucleic acid amplification procedures include 
transcription-based amplification systems (TAS) (Kwoh ef a/., 1989; PCT Intl. Pat. 
Appl. Publ. No. WO 88/10315), including nucleic acid sequence based 
amplification (NASBA) and 3SR. In NASBA, the nucleic adds can be prepared for 
amplification by standard phenol/chlorofomn extraction, heat denaturatlon of a 
sample, treatment with lysis buffer and minispin columns for isolation of DNA and 
RNA orguanidinlum chloride extraction of RNA. These amplification techniques 
involve annealing a primer that has sequences specific to the target sequence. 
Following polymerization, DNA/RNA hybrids are digested with RNase H while 
double stranded DNA molecules are heat-denatured again. In either case the 
single stranded DNA Is made fully double stranded by addition of second target- 
specific primer, followed by polymerization. The double stranded DNA molecules 
are then mulfiply transcribed by a polymerase such as T7 or SP6. In an isothennal 
cyclic reaction, the RNAs are reverse transcribed into DNA, and transcribed once 
again with a polymerase such as T7 or SP6. The resulting pnxiucts. whether 
truncated or complete, indicate target-specific sequences. 
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Eur. Pat AppI- Publ. No. 329,822, disclose a nucleic acfd 
amplification process involving cyclically synthesizing single-stranded RNA 
("ssRNA"). ssDNA, and double-stranded DNA (dsDNA), which may be used in 
accordance with the present invention. The ssRNA is a first template for a first 
5 primer oligonucleotide, which is elongated by reverse transcriptase 

(RNA-dependent DNA polymerase). The RNA is then removed from resulting 
DNA:RNA duplex by the action of ribonuclease H (RNase H, an RNase specific for 
RNA in a duplex with either DNA or RNA), The resultant ssDNA Is a second 
template for a second primer, which also includes the sequences of an RNA 

10 polymerase promoter (exemplified by T7 RNA polymerase) 5' to its homology to its 
template. This primer is then extended by DNA polymerase (exemplified by the 
large "Klenow" fragment of E. coli DNA polymerase I), resulting as a 
double-stranded DNA ("dsDN A") molecule, having a sequence identical to that of 
the original RNA between the primers and having additionally, at one end, a 

1 5 promoter sequence. This promoter sequence can be used by the appropriate RNA 
polymerase to make many RNA copies of the DNA. These copies can then 
re-enter the cycle leading to very swift amplification. With proper choice of 
enzymes, this amplification can be done isothermally without addition of enzymes 
at each cycle. Because of the cyclical nature of this process, the starting 

20 sequence can be chosen to be in the form of either DNA or RNA. 

PCT Intl. Pat. AppI. Publ. No, WO 89/06700. disclose a nucleic acid 
sequence amplification scheme based on the hybridization of a promoter/primer 
sequence to a target single-stranded DNA ("ssDNA") followed by transcription of 
many RNA copies of the sequence. This schema is not cyclic; /.e., new templates 

25 are not produced from the resultant RNA transcripts. Other amplification methods 
include "I^CE" (Frohman, 1990), and "one-sided PGR" (Ohara, 1989) which are 
well-known to those of skill in the art. 
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Compositions and Kits for the Detection of Cancer ^ 

The present Invention further provides kits for use within any of the 
above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, 
5 reagents, containers and/or equipment. For example, one container within a kit 
may contain a monoclonal antibody or fragment thereof that specifically binds to a 
lung tumor protein. Such antibodies or fragments may be provided attached to a 
support material, as described above. One or more additional containers may 
enclose elements, such as reagents or buffers, to be used in the assay. Such kits 
10 may also, or altematively, contain a detection reagent as described above that 
contains a reporter group suitable for direct or indirect detection of antibody 
binding. 

The present invention also provides kits that are suitable for 
performing the detection methods of the present Invention. Exemplary kits 

15 comprise oligonucleotide primer pairs each one of which specifically hybridizes to 
a distinct polynucleotide. Within certain embodiments, kits according to the 
present invention may also comprise a nucleic acid polymerase and suitable 
buffer. Exemplary oligonucleotide primers suitable for kits of ^he present invention 
are disclosed herein. Exemplary polynucleotides suitable for kits of the present 

20 invention are disclosed herein. 

Alternatively, a kit may be designed to detect the level of mRNA 
encoding a lung tumor protein In a biological sample. Such kits generally comprise 
at least one oligonucleotide probe or primer, as described above, that hybridizes to 
a polynucleotide encoding a lung tumor protein. Such an oligonucleotide may t>e 

25 used, for example, within a PCR or hybridization assay. Additional components 
that may be present within such kits include a second oligonucleotide and/or a 
diagnostic reagent or container to facilitate the detection of a polynucleotide 
encoding a lung tumor protein. 
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In ottier related aspects, the present invention further provides 
compositions useful in the methods disclosed herein. Exemplary compositions 
comprise two or more oligonucleotide primer pairs each one of which specifically 
hybridizes to a distinct polynucleotide. Exemplary oligonucleotide primers suitable 
5 for compositions of the present invention are disclosed herein. Exemplary 
polynucleotides suitable for compositions of the present invention are disclosed 
herein. 

The following Example is offered by way of Illustration and not by 
way of limitation. 

10 

EXAMPLE 
Multiplex Detection of Lung Tumors 
A Multiplex Real-time PGR assay was established in order to 
simultaneously detect the expression of four lung cancer-specific genes: L762 
15 (SEQ ID NO:1), L984 (SEQ ID NO:3), L550 (SEQ ID N0:5) and L552 (SEQ ID 
NO:7). in contrast to detection approaches relying on expression analysis of 
single lung cancer-specific genes, this Multiplex assay was able to detect ail lung 
tumor samples tested and analyze their combined mRNA expression profile in 
adenocarcinoma, squamous, small cell and large cell lung tumors. L552S and 
20 L550S complement each other in detecting predominantly adenocarcinomas, 
L762S detects squamous cell carcinomas and L984P detects small cell 
carcinomas (see Table 1 ). 

The primers and probes were designed to be intron spanning (exon 
specific) to eliminate any reactivity with genomic DNA making them suitable for 
25 use in blood samples without having to DNAse treat mRNA samples. They were 
also designed to produce amiicons of different sizes to allow gel differentiation of 
end products if necessary. 

The assay was carried out as follows: L552S (SEQ ID NO: 7), L550 
(SEQ ID NO: 5), L762 (SEQ ID NO: 1 ). L984 (SEQ ID NO: 3) and specific primers, 
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and specific Taqman probes, were used to analyze their combined mRIMA 
expression profile in iung tumors. The primers and probes are shown below: 

L552S: Forward Primer (SEQ ID NO:9): 5' GACGGCATGAGCGACACACA. 
Reverse Primer (SEQ ID NO:10): 5' CCATGTCGCGCACTGGGATC. Probe 
5 (SEQ ID NO:1 1 ) (FAM-5' - 3'-TAMRA): 

CTGAAAGTCGGGATGCTACACCTGGGCA. 

L550P: Fonward Primer (SEQ ID NO:12): 5* GGCCACCGTCTGGATTCTTC. 
Reverse Primer (SEQ ID NO:13): 5' gaagaatccagacE^tggcc. Probe (SEQ ID 
NO:14) (FAM-5' - 3'-TAMRA): CCGCCCCAAG ATCAAATCGA CAAACC. , 

10 L762S: Fonward Primer (SEQ ID NO:15): 5' atggcagaggctgacagactc. Reverse 
Primer (SEQ ID N0:16): 5' TTCAACCAGCTCAAATCCTTTCTTA. Probe (SEQ ID 
NO:17) (FAM-5' - 3'-TAMRA) TCGACAGCAAAGGAGAGATCAGAGCCC. 

L984P: Forward Primer (SEQ ID NO:18): 5' TTACGACCCGCTCAGCCC. 
Reverse Primer (SEQ ID NO:19): 6' ctcccaacgccactgacaa. Probe (SEQ ID NO:20) 
1 5 (FAM-5' - 3'-TAMRA): CCAGGGCGAGCCCCTCAGAAGC. 

The assay conditions were: 
Taaman protocol (7700 Perkin Elmer): 

In 25 final volume: 1x Buffer A, 5mM MgCI, 0.2 mM dCTP, 0.2 mM 

20 dATP. 0.4 mM dUTP. 0.2 mM dGTP, 0.01 U/jil AmpErase UNG, 0.0375 U/^U 
TaqGold, 8% (v/v) Glycerol, 0.05% (v/v) (Sigma). Gelatin, 0.05% (v/v) (Sigma), 
Tween20 0.1 % v/v (Sigma), 300mM of each forward and reverse primer for 
L762P, 50mM of each fooA/ard and reverse primer from (L652S, L984P, L550S, 
L984P) 2 pmol of each gene specific Taqman probe (L552S. L550S. L984P) and 

25 template cDNA. The PGR reaction was carried out at one cycle at 950C for 10 
minutes, followed by 50 cycles at 950C for 1 5 seconds, 60OC for 1 minute, and 
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680C for 1 minute (ABi Prism ZSOOHO Sequenoe Detection System, Foster City, 
CA). 

Since each primer set in the multiplex assay results In a band of 
unique length, expression signals of the four genes of interest was measured 
individually by agarose gel analysis. The combined expression signal of ail four 
genes can also be measured in real-time on an ABI 7700 Prism sequence 
detection system (PE Biosystems, Foster City, CA). Although specific primers 
have been described herein, different primer sequences, different primer or probe 
labeling and different detection systems could be used to perform this multiplex 
assay. For example, a second fluorogenic reporter dye could be incorporated for 
paraliel detection of a reference gene by real-time PGR. Or, for example a SYBR 
Green detection system could be used instead of theTaqman probe approach. 
Table 2 shows the reactivity of the multiplex PCR with different lung tumor types 
and nomnal lung tissue. 

TABLE 2 Expression of Lung Cancer Multiplex Genes (L762P, L552S, L550S, 
L984P) in Lung Tumor and Normal Lung 



Lung Tumor Type 


Positive Samples/Samples Tested 


Adenocarcinoma 


21/24 


Squamous 


17/18 


Large Cell 


5*/5 


Small Cell 


5/6 


Normal Lung Tissue 


0/12 


Total Tumors 


48/53 


% Positive Tumors 


90.57% 



20 * One sample at cut-off 



From tiie foregoing It will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of 
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illustration, various modifications may be made without deviating from the spirit 
and scope of the Invention. Accordingly, the invention Is not limited except as by 
the appended claims. 
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CLAIMS 

We Claim: 

1 . A method for detecting tlie presence of a cancer cell In a patient, 

said metiiod comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting the biological sample with two or more oligonucleotide 
pairs specific for Independent polynucleotide sequences which are unrelated to one 
another, wherein the oligonucleotide pairs hybridize, under moderately stringent 
condiUons. to their respective polynucleotides and the complements thereof; 

(c) amplifying said polynudeotides; and 

(d) detecting said amplified polynucleotides; 

wherein the presence of one or more of said amplified polynucleotides 
indicates the presence of lung cancer cells in said patient 

2. A method for detennining the presence of lung cancer cells In a 
patient, said method comprising the steps of: 

(a) obtaining a biological sample from said patient; 

(b) contacting a biological sample obtained from the patient with two or 
more oligonucleotides that hybridize to two or more polynucleotides that encode two or 

more lung tumor proteins; 

(c) detecting in said biological sample an amount of a polynucleotide 
that hybridizes to at least one of said oligonucleotides; and 

(d) comparing the amount of the polynucleotides that hybridizes to said 
oligonucleotides to a predetennlned cut-off value, and therefrom determining the 
presence or absence of lung cancer cells In the patient. 
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3. A method for monitoring the progression of lung cancer in a patient, 
said method comprising the steps of: 

(a) obtaining a first biological sample from said patient; 

(b) contacting said first biological sample with one or more 
oligonucleotides that hybridize to one or more polynucleotides that encode lung tumor 
proteins; 

(c) detecting in said first biological sample an amount of at least one of 
said polynucleotides that hybridize to said oligonucleotides; 

(d) repeating steps (b) and (c) using a second biological sample 
obtained from said patient at a subsequent point in time; and 

(e) comparing the amount of polynucleotides detected in step (d) with 
the amount detected In step (c) and therefrom monitoring the progression of lung cancer 
in said patient. 
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ABSTRACT OF THE DISCLOSURE 



Compositions and methods for tlie diagnosis of lung cancer are disclosed. 
Such methods are useful to detect eariy tumors or provide adequate stage/grade 
information or tumor specificity. Compositions may comprise one or more lung tumor 
proteins, immunogenic portions thereof, or polynucleotides that encode such portions. 
Such compositions may be used, for example, to improve lung cancer diagnosis and 
prognosis and potentially differentiate between NSCLC and SCLC. 
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SEQUENCE LISTING 

<110> Zehentner, Barbara 
Hayes I Dawn 
Houghton, Raymond L. 

<120> METHODS, COMPOSITIONS AND KITS FOR THE DETECTION 
AND MONITORING OF LONG CANCER 

<130> 210121. 609P1 

<140> US 

<141> 2003-03-24 
<160> 20 

<170> Corixa Invention Disclosure Database 

<210> 1 

<211> 3951 

<212> DNA 

<213> Homo sapiens 

<400> 1 • 

tctgcatcca tattgaaaac ctgacacaat gtatgcagca ggctcagtgt gagtgaactg 60 
gaggcttctc tacaacatga cccaaaggag cattgcaggt cctatttgca acctgaagtt 120 
tgtgactctc ctggttgcct taagttcaga actcccattc ctgggagctg gagtacagct 180 
tcaagacaat gggtataatg gattgctcat tgcaattaat cctcaggtac ctgagaatca 240 
gaacctcatc tcaaacatta aggaaatgat aactgaagct tcattttacc tatttaatgc 300 
taccaagaga agagtatttt tcagaaatat aaagatttta atacctgcca catggaaagc 360 
taataataac agcaaaataa aacaagaatc atatgaaaag gcaaatgtca tagtgactga 420 
ctggtatggg gcacatggag atgatccata caccctacaa tacagagggt gtggaaaaga 480 
gggaaaatac attcatttca cacctaattt cctactgaat gataacttaa cagctggcta 54 0 
cggatcacga ggccgagtgt ttgtccatga atgggcccac ctccgttggg gtgtgttcga 600 
tgagtataac aatgacaaac ctttctacat aaatgggcaa aatcaaatta aagtgacaag 660 
gtgttcatct gacatcacag gcatttttgt gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
ctgtattatt agtaagcttt ttaaagaagg atgcaccttt atctacaata gcacccaaaa 780 
tgcaactgca tcaataatgt tcatgcaaag tttatcttct gtggttgaat tttgtaatgc 84 0 
aagtacccac aaccaagaag caccaaacct acagaaccag atgtgcagcc tcagaagtgc 900 
atgggatgta atcacagact ctgctgactt tcaccacagc tttcccatga acgggactga 960 
gcttccacct cctcccacat tctcgcttgt agaggctggt gacaaagtgg tctgtttagt 1020 
gctggatgtg tccagcaaga tggcagaggc tgacagactc cttcaactac aacaagccgc 1080 
agaattttat ttgatgcaga ttgttgaaat tcataccttc gtgggcattg ccagtttcga 1140 
cagcaaagga gagatcagag cccagctaca ccaaattaac agcaatgatg atcgaaagtt 1200 
gctggtttca tatctgccca ccactgtatc agctaaaaca gacatcagca tttgttcagg 1260 
gcttaagaaa ggatttgagg tggttgaaaa actgaatgga aaagcttatg gctctgtgat 1320 
gatattagtg accagcggag atgataagct tcttggcaat tgcttaccca ctgtgctcag 1380 
cagtggttca acaattcact ccattgccct gggttcatct gcagccccaa atctggagga 14 40 
attatcacgt cttacaggag gtttaaagtt ctttgttcca gatatatcaa actccaatag 1500 
catgattgat gctttcagta gaatttcctc tggaactgga gacattttcc agcaacatat 1560 
tcagcttgaa agtacaggtg aaaatgtcaa acctcaccat caattgaaaa acacagtgac 1620 
tgtggataat actgtgggca acgacactat gtttctagtt acgtggcagg ccagtggtcc 1680 
tcctgagatt atattatttg atcctgatgg acgaaaatac tacacaaata attttatcac 17 40 
caatctaact tttcggacag ctagtctttg gattccagga acagctaagc ctgggcactg 1800 
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gacttacacc ctgaacaata cccatcattc tctgcaagcc ctgaaagtga cagtgacctc 1860 
tcgcgcctcc aactcagctg tgcccccagc cactgtggaa gcctttgtgg aaagagacag 1920 
cctccatttt cctcatcctg tgatgattta tgccaatgtg aaacagggat tttatcccat 1980 
tcttaatgcc actgtcactg ccacagttga gccagagact ggagatcctg ttacgctgag 2040 
actccttgat gatggagcag gtgctgatgt tataaaaaat gatggaattt actcgaggta 2100 
ttttttctcc tttgctgcaa atggtagata tagcttftaaa gtgcatgtca atcactctcc 2160 
cagcataagc accccagccc actctattcc agggagtcat gctatgtatg taccaggtta 2220 
cacagcaaac ggtaatattc agatgaatgc tccaaggaaa tcagtaggca gaaatgagga 2280 
ggagcgaaag tggggcttta gccgagtcag ctcaggaggc tccttttcag tgctgggagt 2340 
tccagctggc ccccaccctg atgtgtttcc accatgcaaa attattgacc tggaagctgt 2400 
aaaagtagaa gaggaattga ccctatcttg gacagcacct ggagaagact ttgatcaggg 24 60 
ccaggctaca agctatgaaa taagaatgag taaaagtcta cagaatatcc aagatgactt 2520 
taacaatgct attttagtaa atacatcaaa gcgaaatcct cagcaagctg gcatcaggga 2580 
gatatttacg ttctcacccc aaatttccac gaatggacct gaacatcagc caaatggaga 264 0 
aacacatgaa agccacagaa tttatgttgc aatacgagca atggatagga actccttaca 2700 
gtctgctgta tctaacattg cccaggcgcc tctgtttatt cccqccaatt ctgatcctgt 27 60 
acctgccaga gattatctta tattgaaagg agttttaaca gcaatgggtt tgataggaat 2820 
catttgcctt attatagttg tgacacatca tactttaagc aggaaaaaga gagcagacaa 2880 
gaaagagaat ggaacaaaat tattataaat aaatatccaa agtgtcttcc ttcttagata 2940 
taagacccat ggccttcgac tacaaaaaca tactaacaaa gtcaaattaa catcaaaact 3000 
gtattaaaat gcattgagtt tttgtacaat acagataaga tttttacatg gtagatcaac 3060 
aaattctttt tgggggtaga ttagaaaacc cttacacttt ggctatgaac aaataataaa 3120 
aattattctt taaagtaatg tctttaaagg caaagggaag ggtaaagtcg gaccagtgtc 3180 
aaggaaagtt tgttttattg aggtggaaaa atagccccaa gcagagaaaa ggagggtagg 3240 
tctgcattat aactgtctgt gtgaagcaat catttagtta ctttgattaa tttttctttt 3300 
ctccttatct gtgcaigaaca ggttgcttgt ttacaactga agatcatgct atatttcata 3360 
tatgaagccc ctaatgcaaa gctctttacc tcttgctatt ttgttatata tattacagat 3420 
gaaatctcac tgctaatgct cagagatctt ttttcactgt aagaggtaac ctttaacaat 3480 
atgggtatta cctttgtctc ttcataccgg ttttatgaca aaggtctatt gaatttattt 3540 
gtttgtaagt ttctactccc atcaaagcag ctttttaagt tattgccttg gttattatgg 3600 
atgatagtta tagcccttat aatgccttaa ctaaggaaga aaagatgtta ttctgagttt 3660 
gttttaatac atatatgaac atatagtttt attcaattaa accaaagaag aggtcagcag 3720 
ggagatacta acctttggaa atgattagct ggctctgttt tttggttaaa taagagtctt 3780 
taatcctttc tccatcaaga gttacttacc aagggcaggg gaagggggat atagaggtcc 3840 
caaggaaata aaaatcatct ttcatcttta attttactcc ttcctcttat ttttttaaaa 3900 
gattatcgaa caataaaatc atttgccttt ttaattaaaa acataaaaaa a 3951 

<210> 2 
<211> 943 
<212> .PRT 

<213> Homo sapiens 
<400> 2 

Met Thr Gin Arg Ser He Ala Gly Pro He Cys Asn Leu Lys Phe Val 

1 5 10 15 . 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Gly 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys Glu Met 

' 50 ' 55 - 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Val 
65 70 ' "^^ . ®^ 

Phe Phe Arg Asn He Lys He Leu He Pro Ala Thr Trp Lys Ala .Asn 
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85 90 , 95 • 

Asn Asn Ser Lys He Lys Gin Glu Ser Tyr Glu Lys Ala Asn Val He 

100 * 105 110 

Val Thr Asp Trp Tyr Gly Ala His Gly Asp Asp Pro Tyr Thr Leu Gin 

115 120 125 

Tyr Arg Gly Cys Gly Lys Glu Gly Lys Tyr lie His Phe Thr Pro Asn 

130 . .135 / 140 

Phe Leu Leu Asn Asp Asn Leu Thr Ala Gly Tyr Gly. Ser Arg Gly Arg 
145 150 .155 160 

Val Phe Val His Glu Trp Ala His Leu Arg Trp Gly Val Phe Asp Glu 

165 170 175 

Tyr Asn Asn Asp Lys Pro Phe Tyr He Asn Gly Gin Asn Gin He Lys . 

180 185 190 

Val Thr Arg Cys Ser Ser Asp He Thr Gly He Phe Val Cys Glu Lys 

195 • 200 205 

isly Pro Cys Pro Gin Glu Asn Cys He He Ser Lys Leu Phe Lys Glu 

210 215 220 

Gly Cys Thr Phe He Tyr Asn Ser Thr Gin Asn Ala Thr Ala Ser He 
225 230 ' 235 240 

Met Phe Met Gin Ser Leu Ser Ser Val Val Glu Phe Cys Asn Ala Ser 

245 250 ■ 255 

Thr His Asn Gin Glu Ala Pro Asn Leu Gin Asn Gin Met Cys Ser Leu 

260 265 270 

Arg Ser Ala Trp Asp Val He Thr Asp Ser Ala Asp Phe His His Ser 

275 280 285 ^ 

Phe Pro Met Asn Gly Thr Glu Leu Pro Pro Pro Pro Thr Phe Ser Leu 

290 295 300 

Val Glu Ala Gly Asp Lys Val Val Cys Leu Val Leu Asp Val Ser Ser 
305 310 315 320 

Lvs Met Ala Glu Ala Asp Arg Leu Leu Gin Leu Gin Gin Ala Ala Glu 

325 ' • 330 335 

Phe Tyr Leu Met Gin He Val Glu He His Thr Phe Val Gly He Ala 

340 345 350 

Ser Phe Asp Ser Lys Gly Glu He Arg Ala Gin Leu His Gin He Asn 

355 360 365 

Ser Asn Asp Asp Arg Lys Leu Leu Val Ser Tyr Leu Pro Thr Thr Val 

370 375 380 

Ser Ala Lys Thr Asp He Ser He Cys Ser Gly Leu Lys Lys Gly Phe 
385 390 395 400 

Glu Val Val Glu Lys Leu Asn Gly Lys Ala Tyr Gly Ser Val Met He 

405 410 415 

Leu Val Thr Ser Gly Asp Asp Lys Leu Leu Gly Asn Cys Leu Pro Thr 

420 425 430 

Val Leu Ser Ser Gly Ser Thr He His Ser He Ala Leu Gly Ser Ser 

435 440 445 

Ala Ala Pro Asn Leu Glu Glu Leu Ser Arg Leu Thr Gly Gly Leu Lys 

450 435 460 

Phe Phe Val Pro Asp He Ser Asn Ser Asn Ser Met He Asp Ala Phe 
465 470 475 480 

Ser Arg He Ser Ser Gly Thr Gly Asp He Phe Gin Gin His He Gin 

485 490 495 

Leu Glu Ser Thr Gly Glu Asn Val Lys Pro His His Gin Leu Lys Asn 

500 505 510 

Thr Val Thr Val Asp Asn Thr Val Gly Asn Asp Tht Met Phe Leu Val 



515 * 520 525 

Thr Trp Gin Ala Ser Gly Pro Pro Glu lie He Leu Phe Asp Pro Asp 

530 535 540 

Glv Arq Lys Tyr Tyr Thr Asn Asn Phe He Thr Asn Leu Thr Phe Arg * 
545 550 555 560 

Thr Ala Ser Leu Trp He Pro Gly Thr Ala Lys Pro Gly His Trp Thr 

565 570 575 

Tyr Thr Leu Asn Asn Thr His His Ser Leu Gin Ala Leu Lys Val Thr 

580 585 590 

Val Thr Ser Arg Ala Ser Asn Ser Ala Val Pro Pro Ala Thr Val Glu 

595 600 605 

Ala Phe Val Glu Arg Asp Ser Leu His Phe Pro His Pro Val Met He 

610 ' 615 620 

Tyr Ala Asn V^l Lys Gin Gly Phe Tyr Pro He Leu Asn Ala Thr Val 
625 630 635 640 

Thr Ala Thr Val Glu Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu 

645 650 655 

Leu Asp Asp Gly Ala Gly Ala Asp Val He Lys Asn Asp Gly He Tyr 

660 665 670 

Ser Arg Tyr Phe Phe Ser Phe Ala Ala Asn Gly Arg Tyr Ser Leu Lys 

675 680 685 

Val His Val Asn His Ser Pro Ser He Ser Thr Pro Ala His Ser He 

690 695 700 

Pro Gly Ser His Ala Met Tyr Val Pro Gly Tyr Thr Ala Asn Gly Asn 
705 710 715 720 

He Gin Met Asn Ala Pro Arg Lys Ser Val Gly Arg Asn Glu Glu- Glu 

725 730 735 

Arg Lys Trp Gly Phe Ser Arg Val Ser Ser Gly Gly Ser Phe Ser Val 

740 745 750 

Leu Gly Val Pro Ala Gly Prd His Pro Asp Val Phe Pro Pro Cys Lys 
755 760 765 

. He He Asp Leu Glu Ala Val Lys Val Glu Glu Glu Leu Thr Leu Ser 
770 775 780 

Trp Thr Ala Pro Gly Glu Asp Phe Asp Gin Gly Gin Ala Thr Ser Tyr 
785 ■ 790 795 800 

Glu He Arg Met Ser Lys Ser Leu Gin Asn He Gin Asp Asp Phe Asn 

805 810 815 

Asn Ala He Leu Val Asn Thr Ser Lys Arg Asn Pro Gin Gin Ala Gly ■ 

820 825 830 

He Arg Glu He Phe Thr Phe Ser Pro Gin He Ser Thr Asn Gly Pro 

835 840 845 

Glu His Gin Pro Asn Gly Glu Thr His Glu Ser His Arg He Tyr Val 

850 855 860 

Ala He Arg Ala Met Asp Arg Asn Ser Leu Gin Ser Ala Val Ser Asn 
865 870 * 875 880 

He Ala Gin Ala Pro Leu Phe He Pro Pro Asn Ser Asp Pro Val Pro 

885 890 895 

Ala Arg Asp Tyr Leu He Leu Lys Gly Val Leu Thr Ala Met Gly Leu 

900 905 910 

He Gly He He Cys Leu He He Val Val Thr His His Thr Leu Ser 

915 920 925 

Arg Lys Lys Arg Ala Asp Lys Lys Glu Asn Gly Thr Lys Leu Leu 
930 935 940 



5 



<210> 3 

<211> 785 

<212> DNA 

<213> Homo sapiens 

<400> 3 

tctgattccg cgactccttg gccgccgctg 
ggcggcgccg gccagcagcc ccagccgcag 
tgtttctttg ccacggccgc agccgcggcg 
gcgcagcagc agcagcagca gcagcagcag 
ggccagccct cagggggcgg tcacaagtca 
tcttcgcccg aactgatgcg ctgcaaacgc 
ctgccgcagc agcagccggc cgccgtggcg 
aagttggtca acctgggctt tgccaccctt 
aagaagatga gtaaggtgga gacactgcgc 
cagctgctgg acgagcatga cgcggtgagc 
accatctccc ccaactactc caacgacttg 
tactcgtcgg acgagggctc ttacgacccg 
ttcaccaact ggttctgagg ggctcggcct 
agcag 

<210> 4 
<211> 236 
<212> PRT 

<213> Homo sapiens 



cgcatggaaa gctctgccaa gatggagagc 60 
ccccagcagc ccttcctgcc gcccgcagcc 120 
gccgcagccg ccgcagcggc ' agcgcagagc 180 
caggcgccgc agctgagacc ggcggccgac 240 
gcgcccaagc aagtcaagcg acagcgctcg 300 
cggctcaact tcagcggctt tggctacagc 360 
cgccgcaacg agcgcgagcg caaccgcgtc 420 
cgggagcacg tccccaacgg cgcggccaac 480 
tcggcggtcg agtacatccg cgcgctgcag 540 
gccgccttcc aggcaggcgt cctgtcgccc 600 
aactccatgg ccggctcgcc ggtctcatcc 660 
ctcagccccg aggagcagga gcttctcgac 720 
ggtcaggccc tggtgcgaat ggactttgga 780 



<400> 4 , o 

Met Glu Ser Ser Ala Lys Met Glu Ser Gly Gly Ala Gly Gin Gin Pro 

15 10 15 

Gin Pro Gin Pro Gin Gin Pro Phe Leu Pro Pro Ala Ala Cys Phe Phe 

20 25 30 

Ala Thr Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Gin 

35 40 45 

Ser Ala Gin Gin Gin Gin Gin, Gin Gin Gin Gin Gin Gin Gin Ala Pro 

50 55 60 

Gin Leu Arg Pro Ala Ala Asp Gly Gin Pro Ser Gly Gly Gly His Lys 
65 70 75 80 

Ser Ala Pro Lys Gin Val Lys- Arg Gin Arg Ser Ser Ser Pro Glu Leu 

85 90 95 

Met Arg Cys Lys Arg Arg Leu Asn Phe Ser Gly Phe Gly Tyr Ser Leu 

100 105 110 

Pro Gin Gin Gin Pro Ala Ala Val Ala tog Arg Asn Glu Arg Glu Arg 

115 120 125 

Asn Arg Val Lys Leu Val Asn Leu Gly Phe Ala Thr Leu Arg Glu His 

130 135 140 

Val Pro Asn Gly Ala Ala Asn Lys Lys Met Ser Lys Val Glu Thr Leu 
145 150 155 160 

Arq Ser Ala Val Glu Tyr He Arg Ala Leu Gin Gin Leu Leu Asp Glu 

165 170 175 

His Asp Ala Val Ser Ala Ala Phe Gin Ala Gly Val Leu Ser Pro Thr 

180 185 190 

He Ser Pro Asn Tyr Ser Asn Asp Leu Asn Ser Met Ala Gly Ser Pro 

195 200 205 

Val Seir Sfer Tyr Ser Ser Asp Glu Gly Ser Tyt Asp Pro Leu Ser Pro 



210 215 220 

Glu Glu Gin Glu Leu Leu Asp Phe Thr Asn Trp Phe 
225 230 235 

<210> 5 

<211> 1633 

<212> DNA 

<213> Homo sapiens 



<400> 5 

cgtggaggca 

ccagactagc 

agacgtccgc 

cagaggtccc 

tgtccgggaa 

atcgggaaat 

ctcccaaaag 

aatccacaaa 

ataatttaaa 

agtatgagaa 

ctgctaaagt 

aggaggagga 

ttagagtagg 

attaggttta 

aattgtcagt 

aagttgtaca 

ctgtgcactt 

atttgtaagg 

tatctatagt 

gcgttgaggc 

gaggctggac 

gtatatagtg 

catgagaata 

tagaactctt 

cctgtactta 

aatgtttttg 

tcaatttctg 

ccctataaat 



gctagcgcga 
gaacaataca 
ttatgccttc 
tgtcaatttt 
agagaaatcc 
gaaggattat 
gccaccgtct 
ccccggcatc 
tgacagtgaa 
ggatgttgct 
tgcccggaaa 
ggaggaggag 
ggagcgccgt 
attacaaaat 
ggtttacatg 
tatttccaaa 
tgctgttggt 
tggtggtaac 
ttgtaaaaag 
tgtggggaag 
ctgttgactc 
acatagcatt 
ttttttttta 
catligtcagc 
aacacgattc. 
aagttaaata 
actcacagca 
gtg 



ggctggggag 
gtcgggatgg 
tttgtgcaga 
gcggaatttt 
aaatttgatg 
ggaccagcta 
ggattcttcc 
tctattggag 
aagcagcctt 
gactataagt 
aaggtggaag 
gatgaataaa 
aattgacaca 
ttgatcacga 
aagtggccat 
catttttaaa 
gtgacaaggc 
tatggttatt 
aacaaaacaa 
atgccttttg 
tgcagggggc 
ctgctgccat 
agtgcggtag 
aaagcaaaga 
gcaacgttct 
aacagtatta 
gtgaacaaac 



cgctgagccg 
ctaaaggtga 
catgcagaga 
ccaagaagtg 
aaatggcaaa 
agggaggcaa 
tgttctgttc 
acgtggcaaa 
acatcactaa 
cgaaaggaaa 
aggaagatga 
gaaactgttt 
tctcttattt 
tcatattgta 
gggtgtctgg 
atgaaaaggc 
atttaaagat 
ggctagaaat 
ccgagacaaa 
ggagaggctg 
atccatttag 
cttagctgtg 
tttttaaact 
gtcactgcat 
gttatttttt 
catttttaga 
ccccactccg 



cgcgtcgtgc 
ccccaagaaa 
agaacataag 
ctctgagagg 
ggcagataaa 
gaagaagaag 
agaattccgc 
aaagctgggt 
ggcggcaaag 
gtttgatggt 
agaacaggag 
atctgtctcc 
gagaagtgtc 
gtctctcaaa 
agcaccctga 
actctcgtgt 
gtttctggca 
cctgagtttt 
cccttgatgc 
tagctcaggg 
cttcaggttg 
gacaaagggg 
gtttgttttt 
caatgaaagt 
ttgtatgttt 
actcttctct 
ttgtatttgg 



cctgcgctgc 
ccaaagggca 
aagaaaaacc 
tggaagacgg 
gtgcgctatg 
gatcctaatg 
cccaagatca 
gagatgtgga 
ctgaaggaga 
gcaaagggtc 
gaggaagaag 
ttgtgaatac 
tgttgccctc 
gtgctctaga 
aactgtatca 
tctcctcact 
ttttcttttt 
caactgtata 
tccttgctcg 
cgtgeactgt 
tcttgtttct 
ggtcagctgg 
aaacaaacta 
tcaagaacct 
agaatgctga 
actataacag 
agactggcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840. 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1633 



<210> 6 
<211> 200 
<212> PRT 

<213> Homo sapiens 
<406> 6 

Met Ala Lys Gly Asp Pro Lys Lys Pro Lys Gly Lys Met Ser Ala Tyr 

15 10 15 

Ala Phe Phe Val Gin Thr Cys Arg Glu Glu His Lys Lys Lys Asn Pro 

20 25 .30 

Glu Val Pro Val Asn Phe Ala Glu Phe Ser Lys Lys Cys Ser Glu Arg ^ 

35 40 45 

Trp Lys Thr Met Ser Gly Lys Glu Lys Ser Lys Phe Asp Glu Met Ala 

50 55 60 

Lys Ala Asp Lys Val Arg Tyr Asp Arg Glu Met Lys Asp Tyr Gly Pro 



7 



65 




70 






75 










80 


Ala 


Lvs Glv Glv Lvs 
85 


Lvs 


Lys Lys Asp 


Pro 
90 


Asn 


Ala 


Pro 


Lys 


Arg 
95 


Pro 


Pro 


Ser Gly Phe Phe 


Leu 


Phe Cys Ser 


Glu 


Phe 


Arg 


Pro Lys 


He 


Lys 




100 




105 










110 






Ser 


Thr Asn Pro Gly 
115 


lie 


Ser lie Gly 
120 


Asp 


Val 


Ala 


Lys 
125 


Lys 


Leu 


Gly 


Glu 


Met Trp Asn Asn 
130 


Leu 


Asn Asp Ser 
135 


Glu 


Lys 


Gin 
140 


Pro 


Tyr 


He 


Thr 


Lys 


Ala Ala Lys Leu 


Lys 


Glu Lys Tyr 


Glu 


Lys 


Asp 


Val 


Ala 


Asp 


Tyr 


145 




150 






155 










160 


Lys 


Ser Lys Gly Lys 


Phe 


Asp Gly Ala 


Lys 


Gly 


Pro 


Ala 


Lys 


Val 


Ala 


\ 

Arg 


165 






170 










175 




Lys Lys Val Glu 


Glu 


Glu Asp Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


180 




185 










190 







Glu Glu Glu Glu Glu Glu Asp Glu 
195 200 



<210> 7 

<211> 781 

<212> DNA 

<213> Homo sapiens 



<400> 7 

gcggcggagc tgtgagccgg cgactcgggt 
gagacacggc gggtaggtcc acaggcagat 
gaagaggaac cagcaggctt ccggagggtt 
ctcgaagtcg tcgtccctct catgcggtgc 
gccataacta gggaggaagg agggccgagg 
tgttgggggt atccgagtcc cagaagcacc 
cagacgggac caggagaggg acggcatgag 
agtcccagga gcccagtaat ggagagcccc 
atcctacacc tgggcagcag acagaagaag 
acatggaagg tgatctgcaa gagctgcatc 
ggttccggcg tcaaggtgaa gataatacct 
ggtgaagagc aaccacaagt ttaaatgaag 
attagatatt tgacttaaac tatctcaata 
a 



ccctgaggtc tggattcttt ctccgctact 60 
ccaactggga gttgaagtgt gagtgagagt 120 
gtgtggtcag tgactcagag tgagaaggcc 180 
cacgcccatg gaccttcttg tctcgtcacg 240 
agtggagggg ctcaggcgaa gctggggtgc 300 
tggaaccccg acagaagatt ctggactccc 360 
cgacacacac aaacacagaa ccacacagcc 420 
aaaaagaaga accagcagct gaaagtcggg 480 
atcaggatac agctgagatc ccagtgcgcg 540 
agtcaaacac cggggataaa tctggatttg 600 
aaagaggaac actgtaaaat gccagaagca 660 
acaagctgaa acaacgcaag ctggttttat 720 
aagttttgca gctttcacca aaaaaaaaaa 780 

781 



<210> 8 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Arg Cys His Ala His Gly Pro 

1 5 
Arg Glu Glu Gly Gly Pro Arg Ser 
20 

Cys Cys Trp Gly Tyr Pro Ser Pro 

.35 40 
Arg' Phe Trp Thr Pro Gin Tht Gly 

50 55 
His Thr Gin Thr Gin Asn His Thr 



Ser Cys Leu' Val Thr Ala He Thr 

10 15 
Gly Gly Ala Gin Ala Lys Leu Gly 
25 30 
Arg Ser Thr Trp Asn Pro Asp Arg 
45 

Pro Gly Glu Gly Arg His Glu Arg 
60 

Ala Ser Pro Arg Ser Pro Val Met 



65 • 70 ^75 80 

Glu Ser Pro Lys Lys Lys Asn Gin Gin Leu Lys Val Gly He Leu His 

85 ^ 90 9S 

Leu Gly Ser Arg Gin Lys Lys He Arg He Gin Leu Arg Ser Gin Cys 

100 105 110 

Ala Thr Trp Lys Val He Cys Lys Ser Cys He Ser Gin Thr Pro Gly 

115 120 125 

He Asn Leu Asp . Leu Gly Ser Gly Val Lys Val Lys He He Pro Lys 

130 135 140 

Glu Glu His Cys Lys l^Iet Pro Glu Ala Gly Glu Glu Gin Pro Gin Val 
145 150 155 160. 



<210> 9 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 9 

gacggcatga gcgacacaca 

<210>.10 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ccatgtcgcg cactgggatc 



20 



20 



<210> 11 

<211> 28 

<212> DNA 

<213> Homo sapiens 

<4b0> 11 

ctgaaagtcg ggatcctaca cctgggca 

<210> 12 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ggccaccgtc tggattcttc 

<210> 13 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gaagaatcca gacggtggcc 



28 



20 



20 



<210> 14 
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